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1 GCCACCGACA TCCGCCGCAA TGCTGTGTCT 
61 GCTCCTTCCC TCTCTCGCTG ATCGACCGAG 
121 TCGCCTGAGC GTGGTTTCTT CGCCGCGCCG 
181 CTCAGTTCCT GCGACTGCGC GAAAAAACAA 
241 CCACCTTCCT ATATATGATC TGGACCCTAA 
301 TAGGATAAAA AGATACCTCG ACCAGAAATG 
361 AGAATTTTCT AAAGGCTATT TGAAGTTTGG 
421 TCGTGAATGG GCGCCTGCTG CACAAGAAGC 
481 TGGTGCAAAA CACAAGATGG AGAAGGATAA 
541 TGTCAATGGG AAGCCTGCCA TCCCTCACAA 
601 GGGTGGAGCA TGGGTTGATC GTATTCCCGC 
661 TAAATTTGGA GCTCCATATG ATGGTGTACA 
721 GTTTAAGCAT CCTCGACCTC CAAAACCTGA 
781 GATGAGTGGT GAAGAGCCAG AAGTAAGCAC 
841 ACGCATACGG GCAAATAACT ACAACACAGT 
901 CTATGCTTCT TTTGGGTATC ACGTGACAAA 
961 ACCAGAGGAT CTGAAATATC TTGTTGACAA 
1021 GGATGTTGTC CATAGCCATG CGAGTAATAA 
1081 TGGACAAAAC ACTCATGAGT CTTATTTTCA 
1141 GGATAGTCGT CTGTTCAACT ATGCCAATTG 
1201 GAGATATTGG ATGGACGAAT TCATGTTTGA 
1261 GCTATACCAT CACCATGGTA TCAATAAGGG 
1321 TTTGGATACC GATGTGGATG CAATTGTTTA 
1381 ACTCTTGCCG GAAGCAACTA TTGTTGCTGA 
1441 GCCAGTTGAT GAAGGTGGAG TAGGGTTTGA 
1501 ATGGATTGAC TACCTGAAGA ACAAAGAGGA 
1561 AACTTTGACT AACAGGAGAT ATACAGAAAA 
1621 GTCCATTGTT GGTGACAAGA CTATAGCATT 
1681 CATGTCAGAC TTGCAGCCTG CTTCACCTAC 
1741 GATTCACTTC ATTACGATGG CCCTTGGAGG 
1801 GTTTGGCCAT CCAGAATGGA TTGACTTTCC 
1861 ATGCAGACGT CAGTGGAGCC TTGTCGACAC 
1921 ATTTGATCAA GCAATGAATG CACTCGAGGA 
1981 GATTGTTAGC GACATGAACG AGAAAGATAA 
2041 TTTTGTTTTC AATTTTCATC CCAACAAAAC 
2101 GCCCGGGAAG TACAGAGTAG CTCTGGACTC 
2161 AGTTGGCCAT GATGTGGATC ACTTCACGTC 
2221 AAATTTCAAC AACCGCCCTA ACTCATTCAA 
2281 TTACTATCGC GTTGATGAAG ATCOTGAAGA 
2341 AAAGATTGTT ACAGAGTATA TCGATGTTGA 
2401 CTGGAAGGGC TCCGAGAAGG ACGATTGTGG 
2461 TTCTGACGAA GACTGCAAAT GAAGCATCAG 
2521 CTTGTAATCT GGAGATCCTG GCTTGCCTTG 
2581 TGTACCTATC TATGATATGA ACTTTATGTA 
2641 ATGATGTGGC CTTAAACCTG AGCTGCACAA 
2701 CATCCAGAAT AAAACAGCTG TTCATTTACC 



CACCTCCTCT TCCTCCTCCG CGCCCGCTCC 
CCCGGGAATC GCGGGCGGGG GTGGCAATGT 
GTCGTGGCCT GGAAAGGTCA AGACCAATTT 
AACCATGGTG ACTGTTGTGG AGGAGCTCGA 
GTTGGAGGAA TTCAAGGATC ACTTCAACTA 
CCTGATTGAA AAACATGAGG GGGGCCTTGA 
GATTAATACA GTTGATGGTG CCACAATATA 
ACAGCTCATT GGTGAGTTCA ATAACTGGAA 
ATTTGGCATT TGGTCAATCA AGATTTCACA 
TTCCAAGGTT AAATTTCGCT TTAGGCATGG 
ATGGATTCGT TATGCAACTT TTGATGCCTC 
CTGGGATCCT CCAGCCTGTG AAAGGTACGT 
TGCTCCACGC ATCTATGAGG CTCATGTGGG 
ATACAGAGAA TTTGCAGACA ATGTGTTACC 
TCAGTTAATG GCAATCATGG AACATTCCTA 
TTTTTTCGCA GTCAGCAGCA GATCAGGAAC 
GGCACATAGT TTAGGATTAC GAGTTCTGAT 
TGTGACCGAT GGTCTAAATG GCTATGACGT 
TACAGGAGAT AGGGGCTACC ATAAACTCTG 
GGAGGTCTTA AGATTTCTTC TTTCTAATTT 
TGGCTTCCGA TTTGATGGGG TTACATCAAT 
ATTTACTGGA AACTACAAGG AGTATTTCAG 
CATGATGCTC GCAAACCATT TAATGCATAA 
AGATGTTTCG GGCATGCCAG TGCTTTGTCG 
CTTCCGCCTG GCAATGGCCA TTCCTGATAG 
CCGCAAATGG TCAATGAGTG AAATAGTGCA 
ATGCATTGCC TATGCCGAGA GCCATGATCA 
TCTCTTGATG GACAAGGAAA TGTACACTGG 
CATCAACCGT GGCATTGCAC TCCAAAAGAT 
TGATGGCTAC TTAAATTTTA TGGGCAATGA 
AAGAGAAGGC AACAACTGGA GCTATGATAA 
TGATCACCTT CGATACAAGT ATATGAATGC 
GGAATTTTCC TTCCTGTCAT CATCAAAGCA 
GGTTATTGTC TTTGAACGTG GAGATTTGGT 
TTACAAGGGT TACAAAGTCG GATGTGACTT 
TGATGCTTTG GTCTTTGGTG GCCATGGAAG 
TCCCGAGGGA ATGCCAGGAG TACCAGAAAC 
AGTCCTTTCC CCGCCCCGTA CCTGTGTGGC 
GCTCAGGAGG GGTGGAGCAG TTGCTTCTGG 
AGCAACAAGT GGGGAGACTA TCTCTGGTGG 
CAAGAAAGGG ATGAAGTTTG TGTTTCGGTC 
ATTTCTTGAT CAGGAGCAAC TGTTGGTGCC 
GACTTGGTTG TGGTTCTTTA GCAGTTGCTA 
TAGTTCGCCT TAAAGAAAGA ATAAGCAGTG 
GCCTAATGTA AAAATAAAGT TTCAGGCTTT 
ATCTCAAAA 
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1 CTTGACTCCC CCCACTCCTC CCTCGTGCTG 
61 CATTTGCGGC GGGAGGGATC TGCGCGCGAG 
121 ACCGGGGGAT GGCGTCGTTC GCGGTGTCCG 
181 GCGGCGGCGG CGGCGGGGGT GGCCCGGCGG 
241 TGCTCTTCAG GAGGAAGGAC TCCTTCTCAC 
301 GGAAGGTGCT GGTGCCTGGC GGTGGGAGCG 
361 TGGAAACTCA AGAGCAACCT GAAGAATCTC 
421 TTGAGGAGGA GGAAGAGATT CCAGCAGTGG 
481 ACAAACTTGA ATCTTCAGAA GTGATTCAAG 
541 TCAAAGATGC TGATGAACCA ACTGTGGAGG 
601 ATGGGCAGAA GATATACCAA ATTGACCCAA 
661 ACCGATACAG TGAATACAAG AGAATGCGTG 
721 ATGCATTTTC TCGTGGTTAC GAAAAGCTTG 
781 ACCGAGAATG GGCACCTGGA GCACAGTCTG 
841 ACCCAAATGC AGATACTATG ACCAGAAATG 
901 ACAATGCTGA TGGATCCCCT GCTATTGCTC 
961 CACCATCTGG CGTAAAGGAT TCAATTCCTG 
1021 GTGAAATACC GTACAACGGT ATATATTATG 
1081 AACATCCTCA ACCTAAACGA CCAAATTCGC 
1141 GTAGCCCGGA ACCGAAGATA AACACATATG 
1201 TTAAAAAGCT TGGGTACAAT GCTGTACAGA 
1261 CAAGCTTTGG GTATCATGTT ACTAACTTCT 
1321 AAGACTTGAA ATCTCTGATT GATAAAGCTC 
1381 TTGTTCACAG TCATGCATCA AACAATACCC 
1441 ATACACATTA CTTCCATGGT GGACCACGGG 
1501 TCAACTATGG GAGTTGGGAA GTTTTAAGAT 
1561 AAGAATACAA GTTTGATGGG TTTCGATTTG 
1621 ATGGTTTACA GGTGGCATTT ACTGGCAACT 
1681 TTGATGCAGT AGTTTACTTG ATGCTGGTGA 
1741 CTGTAGCCAT TGGTGAAGAT GTCAGCGGGA 
1801 GTGGTGTTGG TTTTGACTAT CGTTTGCATA 
1861 TCAAGCAAAG TGACGAATAT TGGAAAATGG 
1921 GGTGGTCAGA GAAGTGTGTT ACTTATGCAG 
1981 AGACTATTGC ATTCTGGTTG ATGGATAAGG 
2041 CTTCAACACC TCGCATTGAT CGTGGGATAG 
2101 TGGGCTTAGG AGGCGAAGGC TATCTTAATT 
2161 GGATAGATTT CCCAAGAGGC CCGCAAAGTC 
2221 ACTACAGTTT TGATAAATGC CGTCGTAGAT 
2281 ATCATGGTAT GCAAGAGTTT GATCAGGCCA 
2341 TGACATCTGA GCACCAGTAT ATATCGCGCA 
2401 AGAGAGGAGA TTTGGTATTC GTGTTCAACT 
2461 GCGTCGGTTG TTTAAAGCCT GGAAAGTACA 
2521 TTGGTGGATT CAGTCGGCTT GATCATGATG 
2581 ACAACAGACC ATGTTCATTC TCGGTGTACA 
2641 TTACAGAGGA CTAATGATCA GCTCTGATCA 
2701 AATGACGCCG GAATACAACT CAAGTGAAAG 
2761 TTTGAGGGGC TTGTGTTTCA TCGCCAATGC 
2821 GAGCTCCTCT TTTCGAGTGA CTCGTGAAGG 
2881 AGCTATACCA AACCCATCCT ATGTTGCGCA 
2941 GTTGGCATTT GTATGTTTAT GAATAATCTG 
3001 AGTGCTCAAA AAAAA 



CTCCTCCTCG TCGCTCGGCT CGAGGCGCGG 
TGCGTGCGGG CAGGCGGCGG GGGAGCACGC 
GCGCGAGGCT CGGGGTCGTG CGGGCGGGGG 
CGCGATCCGG CGGGGTGGAC TTGCCGTCGG 
GTGGCGTTGT GAGCTGCGCG GGTGCTCCTG 
ACGACTTGCT CTCCTCTGCG GAACCAGACG 
AGATACCTGA TGATAATAAA GTAAAACCTT 
CAGAAGCAAG CATAAAGGTT GTGGCTGAAG 
ACATTGAGGA AAATGTGACT GAGGGTGTGA 
ATAAACCACG AGTTATCCCA CCACCAGGAG 
TGCTGGAAGG ATTTCGGAAC CATCTTGACT 
CAGCTATTGA CCAACATGAA GGTGGCTTGG 
GATTCACCCG CAGCGCTGAA GGCATTACCT 
CAGCATTAGT AGGTGACTTC AACAATTGGA 
AGTATGGTGT TTGGGAGATT TCCCTGCCTA 
ATGGCTCACG TGTAAAGATT CGGATGGATA 
CCTGGATTAA GTTTGCTGTG CAGGCTCCAG 
ATCCACCTGA AGAAGAAAAA TATGTATTCC 
TGCGGATATA TGAATCACAT ATTGGAATGA 
CTAATTTTAG GGATGAGGTG CTACCAAGAA 
TAATGGCAAT CCAGGAGCAC TCTTATTACG 
TTGCGCCAAG TAGCCGTTTC GGAACCCCAG 
ACGAGCTTGG TTTGCTTGTA CTTATGGATA 
TGGATGGTTT GAATGGTTTT GATGGTACTG 
GTCATCACTG GATGTGGGAT TCTCGCCTGT 
ATTTACTGTC GAATGCAAGG TGGTGGCTTG 
ATGGGGTGAC CTCCATGATG TATACTCATC 
ATGGCGAATA TTTTGGATTT GCTACTGATG 
ACGATCTAAT TCATGGGCTT TATCCTGAGG 
TGCCCACATT TTGTATTCCT GTTCAAGATG 
TGGCTGTACC GGACAAATGG ATCGAACTCC 
GTGATATCGT GCACACCCTA ACGAATAGAA 
AAAGTCATGA CCAAGCACTA GTTGGTGACA 
ATATGTATGA TTTTATGGCT CTAGACAGAC 
CATTACATAA AATGATTAGG CTTGTCACCA 
TCATGGGAAA TGAGTTTGGG CATCCTGAAT 
TTCCAAATGG CTCGGTCCTC CCAGGAAACA 
TTGACCTTGG AGATGCAGAT TATCTTAGAT 
TGCAGCATCT TGAGGAAAAA TATGGATTCA 
AACACGAGGA GGATAAGGTG ATCATCTTCG 
TCCACTGGAG TAATAGCTAT TTTGACTATC 
AGATTGTGTT GGACTCAGAC GATGGCCTCT 
CTGAGTACTT CACTGCTGAC TGGCCGCATG 
CCCCAAGCAG AACCGCCGTC GTGTATGCAC 
TTGGGGGAAC AACTCAAGGG AGTTGGTGGT 
GTGAAAAGAA AGGCTGCCCT GACGATGTGA 
CAGGAAGATG AGGTAGAAAA GCCTACTGAT 
AAATAGACCA GGGTGAACGG CTTTTTTCAG 
TTCGCTGTAG TTTTGTACAT AACGATATCG 
TTCGACAGAA ATGTTTTTCT CCTTGTATTT 
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1 CGGCGCACAC CCACACACCG ACCACCAGGC AGCGCCTCCT CGCTTTGGCT CTCGCGTGAG 
61 GAGGGTTTAG GTGGAAGCAG AGCGCGGGGG TTGCCGGGGG ATCCGATCCG GCTGCGGTGC 
121 GGGCGAGATG GCGGCGCCGG CGTCTGCGGT TCCCGGGAGC GCGGCGGGGC TACGGGCGGG 
181 GGCCGTGCGG TTCCCCGTGC CAGCCGGGGC CCGGAGCTGG CGTGCGGCGG CGGAGCTCCC 
241 GACGTCGCGG TCGCTGCTCT CCGGCCGGAG ATTCCCCGGT GCCGTTCGCG TGGGGGGTTC 
301 CGGGGGGCGC GTGGCCGTGC GCGCGGCGGG CGCGTCAGGG GAGGTGATGA TCCCCGAGGG 
361 CGAGAGCGAC GGGATGCCGG TTTCAGCAGG TTCAGACGAT CTGCAGTTGC CAGCCTTAGA 
421 TGATGAATTA AGCACGGAGG TTGGAGCTGA AGTTGAGATT GAGTCATCTG GAGCAAGTGA 
481 CGTTGAAGGC GTGAAGAGAG TGGTTGAAGA ATTAGCTGCT GAGCAGAAAC CACGAGTTGT 
541 CCCACCAACA GGAGATGGGC AAAAAATATT CCAGATGGAC TCTATGCTTA ATGGCTATAA 
601 GTACCATCTT GAATATCGAT ATAGCCTATA TAGGAGACTG CGTTCAGACA TTGATCAGTA 
661 TGAAGGAGGA CTGGAAACAT TTTCTCGCGG TTATGAGAAG TTTGGATTTA ATCACAGTGC 
721 TGAAGGTGTC ACTTATCGAG AATGGGCTCC CGGGGCACAT TCTGCAGCAT TAGTAGGTGA 
781 CTTCAACAAT TGGAATCCAA ATGCAGACCG CATGAGCAAA AATGAGTTTG GTGTTTGGGA 
841 GATTTTTCTG CCTAACAATG CTGATGGCTC ATCTCCTATT CCACATGGCT CACGTGTAAA 
901 GGTGCGAATG GAAACTCCAT CTGGTATAAA GGATTCTATT CCTGCCTGGA TCAAGTACTC 
961 TGTGCAGGCC GCAGGAGAAA TCCCATACAA TGGAATATAT TATGATCCTC CTGAAGAGGA 
1021 GAAGTACATA TTCAAGCATC CTCAACCTAA AAGACCAAAG TCATTGCGGA TATACGAAAC 
1081 TCATGTTGGA ATGAGTAGCA CGGAGCCAAA GATCAACACG TATGCAAACT TTAGGGATGA 
1141 GGTGCTTCCA AGAATCAAAA AGCTTGGATA CAATGCAGTG CAAATAATGG CAATTCAAGA 
1201 GCATGCATAT TATGGAAGCT TTGGGTACCA TGTCACCAAT TTCTTTGCAC CAAGTAGTCG 
1261 TTTCGGGACC CCAGAAGATT TAAAGTCATT GATTGATAAA GCTCATGAGC TTGGTTTAGT 
1321 TGTGCTCATG GATGTTGTTC ACAGCCATGC GTCAAATAAT ACCCTAGATG GGTTGAACGG 
1381 TTTTGATGGT AC AGAT AC GC ATTACTTTCA TAGTGGTTCA CGCGGCCATC ATTGGATGTG 
1441 GGATTCTCGC CTTTTCAACT ATGGGAATTG GGAAGTTCTA AGATTTCTAC TATCCAATGC 
1501 AAGATGGTGG CTCGAGGAGT ATAAGTTTGA TGGTTTCAGA TTTGACGGTG TAACCTCAAT 
1561 GATGTACACT CATCATGGAT TACAAGTAGC ATTTACGGGG AACTACAGTG AATACTTTGG 
1621 ATTTGCCACT GATGCTGATG CAGTAGTTTA CTTGATGCTG GTAAATGATT TAATTCATGG 
1681 ACTTTATCCT GAGGCCATAA CCATCGGTGA AGATGTCAGT GGAATGCCTA CATTTGCCCT 
1741 TCCTGTTCAA GATGGTGGGG TTGGTTTTGA TTATCGCCTT CATATGGCTG TTCCTGACAA 
1801 ATGGATTGAA CTCCTCAAGC AAAGTGATGA ATCTTGGAAG ATGGGTGATA TTGTGCACAC 
1861 ACTGACTAAC AGAAGGTGGT CAGAGAAGTG TGTTACTTAT GCTGAAAGTC ATGATCAAGC 
1921 ACTAGTTGGT GACAAAACTA TTGCATTCTG GTTGATGGAC AAGGATATGT ATGATTTTAT 
1981 GGCTCTGGAC AGACCGGCAA CACCTAGCAT TGATCGTGGA ATAGCATTGC ATAAAATGAT 
2041 TAGACTTATC ACAATGGGGT TAGGAGGAGA AGGCTATCTT AACTTTATGG GAAATGAGTT 
2101 CGGACATCCT GAATGGATTG ATTTTCCAAG AGCTCCACAA GTACTTCCAA ATGGTAAATT 
2161 CATCCCAGGG AATAACAACA GTTATGATAA ATGCCGTCGA AGATTTGACC TGGGTGATGC 
2221 GGACTATCTT AGGTATCGTG GCATGCTAGA GTTTGACCGC GCGATGCAGT CTCTCGAGGA 
2281 AAAATATGGG TTCATGACAT CAGACCACCA GTACATATCT CGAAAGCATG AAGAGGATAA 
2341 GATGATTATA TTTGAGAAGG GAGATCTGGT ATTTGTGTTC AACTTCCATT GGAGTAACAG 
2401 CTATTTTGAC TACCGTGTTG GTTOTTTAAA GCCAGGGAAA TATAAGGTGG TCTTGGACTC 
2461 AGATGCTGGA CTCTTTGGTG GATTTGGCAG GATCCATCAC ACTGCAGAGC ACTTCACTGC 
2521 CGATTGTTCA CATGACAACA GGCCCTACTC GTTCTCAGTT TATTCTCCTA GCAGAACCTG 
2581 CGTTGTCTAT GCTCCAGCGG AATGAGAACA CCAAGAGGCA GCATGCAAGT GTGTGCGGCT 
2641 GCTAGTGCGA AGGAGCAAGA AAAACTAGTT GCCAGCAATC TGTGAACGGC TTTCCTAGGT 
2701 TCTGCTTCGA TGAATGCCGG ATAGACTAGA CAGCTTGCTT TTGTGCTTTG CGCTCCCAAT 
2761 TTGTAGTTTT AGTTTGTGAG GGAAAGAAAC GTTTATTTGT AATTATCTAT GGCTGTCGAA 
2821 CGGCGACGAA ACCATGAACC CCGTATATTT GTTGGTACCG TTCGAACTGC CAGTTATACA 
2881 TAGTTCTGCA CTTCTGTACA TCTTGTGATG CTTGAATC 
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£pnI-6S7.G CTAC'C 
Xbal- 661 - "FCTAG A 
Spel - 682 - A"CTAG T 



SwRI- 1172 - G'AATTC 
Xkol- 1178 - CTCGA_G 
EamHl- 1190 - G"GATC_C 
1200 - G_GTAC'C 
Afcff- 1209-GC'GGCC GC 
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Promoter Exon (sense) 



Intron Exon (antisense) Terminator 



Linear structure 



_ Exon (sense) tmr*T 



Duplex formation Exon («■«■«•■•> 






itfriI-2151 - GC'GGCCGC 
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NoU- 13 - GC'GGCCGC 
I fUnSm - 39 - A'AGCTT 




ffoiSm - 2297 - A'AGCTT 
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82 GCGCGGGGGTTGCCGGGGGATCCGATCCGGCTGCG . GTGCGGGCGAGATG 130 

MINI III! I Mil IMI I III Mill Ml I I 

55 gcgcggcatttgcggcggga.gggatctgcgcgcgagtgcgtgcgggcag 103 
131 GCGGC GCCGGCGTCTGCGGTTCCCGGGA 158 

Mill I II III Mill II II 

104 gcggcgggggagcacgcaccgggggatggcgtcgttcgcggtgtcc.ggc 152 
159 GCGCGGCGGGGCTACGGGCGGGGGCCGTGCGGTTCCCCGTGCCAGCCGGG 208 

III III II II Mill I I Mil I I I I II III 

153 gcgaggctcggggtcgtgcgggcggggggcgg cggcggcggcgggg 198 

• • • • • 

209 GCCCGGAGCTGGCGTGCGGCGGCGGAGCTCCCGACGTCGCGGTCGCTGCT 258 

I I I II I I llllll I I I II 1 1 1 1 1 1 1 1 

199 gtggcccggcggcgcgatccggcgggg . . . • tggacttgccgtcggtgct 244 

• • • • • 

259 CTCCGGCCGGAGATTCCCCGGTGCCGTTCGCGTGGGGGGTTCCGGGGGGC 308 

till I I I II 1 III I llllll 

i i i i iii ii i mi i iiiiii 

245 cttcaggagga aggactccttctcacgtggcgtt 278 

• • • • ■ 

309 GCGTGGCCGTGCGCGCGGCGGGCGCGTCAGGGGAGGTGATGATCCCCGAG 358 

III II Mill II I III Mill II I II I 

279 gtgagctgcgcgggtgctcctgggaaggtgctggtgcctggc 320 

• • • • • 

359 GGCGAGAGCGACGGGATGCCGGTTTCAGCAGGTTCAGACG 398 

II I MINIM III I II II I llllll 

321 ggtgggagcgacgacttgctgtcctctgcggaaccagacgtggaaactca 370 

■ • ■ • • 
399 ATCTGCAGTTGCC AGCCT 416 

MM III I II I III 

371 agagcaacctgaagaatct . cagatacctgatgataataaagtaaaacct 419 

• • • • • 

417 T AGATGATGAATTAAGCACGGAGGT 441 

I II I III Mill MM 

420 tttgaggaggaggaagagattccagcagtggcagaagcaagcataaaggt 469 

• • • • ■ 

442 TGGAGCTGAAGTTGAGATTGAGTCATC TGGAG 473 

II lllllll I MM II II II I 

470 tgtggctgaagacaaacttgaatcttcagaagtgattcaagacattgagg 519 

• ■ • • • 

474 CAAGTGACGTTGAAGGCGTGAAGAGAGTGGTTGAAGAATTAGCTGCTGAG 523 

II II III II MM I II I III III I III III 

520 aaaatgtgactgagggtgtgatcaaagatgctgatgaaccaactgtggag 569 

• • • • • 

524 CAGAAACCACGAGTTGTCCCACCAACAGGAGATGGGCAAAAAATATTCCA 573 

I 1 1 1 1 1 1 1 1 1 1 1 1 MINIM 1 1 1 1 1 1 1 1 1 1 1 1 1 II MM III 

570 gataaaccacgagttatcccaccaccaggagatgggcagaagatatacca 619 

• • • • • 

574 GATGGACTCTATGCTTAATGGCTATAAGTACCATCTTGAATATCGATATA 623 

II III I Mill I II I I I I I I I I I I I I I II Mill I 

620 aattgacccaatgctggaaggatttcggaaccatcttgactaccgataca 669 



624 GCCTATATAGGAGACTGCGTTCAGACATTGATCAGTATGAAGGAGGACTG 673 

I III I MM Mill Ml Mill II llllll II II 

670 gtgaatacaagagaatgcgtgcagctattgaccaacatgaaggtggcttg 719 
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674 GAAACATTTTCTCGCGGTTATGAGAAGTTTGGATTTAATCACAGTGCTGA 723 

ii iiiiiiiiii inn ii in mini i i in mil 7fiq 

720 gatgcattttctcgtggttacgaaaagcttggattcacccgcagcgctga 769 

724 AGGTGTCACTTATCGAGAATGGGCTCCCGGGGCACATTCTGCAGCATTAG 773 

IN | || II I HUM II II II II INN I I Mill Ml III 
770 aggcattacctaccgagaatgggcacctggagcacagtctgcagcattag 819 

774 TAGGTGACTTCAACAATTGGAATCCAAATGCAGACCGCATGAGCAAAAAT 823 

ii iii iii iii ii i him i ii ii in 1 1 mi i 1 1 1 1 ii mi oca 

820 taggtgacttcaacaattggaacccaaatgcagatactatgaccagaaat 869 
824 GAGTTTGGTGTTTGGGAGATTTTTCTGCCTAACAATGCTGATGGCTCATC 873 

1 1 1 1 1 1 MM Mill IN I i llllllllllllllllllll II I 

870 gagtatggtgtttgggagatttccctgcctaacaatgctgatggatcccc 919 
874 TCCTATTCCACATGGCTCACGTGTAAAGGTGCGAATGGAAACTCCATCTG 923 

I Illllll MIIMIMIIIMIIII I II Mill II mini 

920 tgctattcctcatggctcacgtgtaaagattcggatggatacaccatctg 969 
924 GTATAAAGGATTCTATTCCTGCCTGGATCAAGTACTCTGTGCAGGCCGCA 973 

i iiiiiiiiii iiiiiiiiiiim mi iiiiiiiiii n 

970 gcgtaaaggattcaattcctgcctggattaagtttgctgtgcaggctcca 1019 
974 GGAGAAATCCCATACAATGGAATATATTATGATCCTCCTGAAGAGGAGAA 1023 

ii inn ii inn ii iiiiiimmii imiiii ii ii 

1020 ggtgaaataccgtacaacggtatatattatgatccacctgaagaagaaaa 1069 

• • • • 

1024 GTACATATTCAAGCATCCTCAACCTAAAAGACCAAAGTCATTGCGGATAT 1073 

ii inn i miimmim mini n imiiiii 

1070 atatgtattccaacatcctcaacctaaacgaccaaattcgctgcggatat 1119 

. . • • • 

1074 ACGAAACTCATGTTGGAATGAGTAGCACGGAGCCAAAGATCAACACGTAT 1123 

i in i iii iiiiiMiiiMii mi ii inn urn m 

1120 atgaatcacatattggaatgagtagcccggaaccgaagataaacacatat 1169 

♦ • • • • 

1124 GCAAACTTTAGGGATGAGGTGCTTCCAAGAATCAAAAAGCTTGGATACAA 1173 

II II ! 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 IMIIIII MIIMMIM Mill 

1170 gctaattttagggatgaggtgctaccaagaattaaaaagcttgggtacaa 1219 

• • • • • 

1174 TGCAGTGCAAATAATGGCAATTCAAGAGCATGCATATTATGGAAGCTTTG 1223 

iii ii ii minimi n inn i inn i imiiii 

1220 tgctgtacagataatggcaatccaggagcactcttattacgcaagctttg 1269 
1224 GGTACCATGTCACCAATTTCTTTGCACCAAGTAGTCGTTTCGGGACCCCA 1273 

mi inn ii ii imiiiii imiiiii 1 1 ii 1 1 1 1 mm 

1270 ggtatcatgttactaacttctttgcgccaagtagccgtttcggaacccca 1319 

• • • • • 
1274 GAAGATTTAAAGTCATTGATTGATAAAGCTCATGAGCTTGGTTTAGTTGT 1323 

Mill II II II I II I MM Ml II I II 1 1 1 1 1 ! 1 1 1 1 1 MM 

1320 gaagacttgaaatctctgattgataaagctcacgagcttggtttgcttgt 1369 

■ • • • • 

1324 GCTCATGGATGTTGTTCACAGCCATGCGTCAAATAATACCCTAGATGGGT 1373 

ii mill iiiiiiiiii inn inn imiiii inn i 

1370 acttatggatattgttcacagtcatgcatcaaacaataccctggatggtt 1419 
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• «■ " 

1374 TGAACGGTTTTGATGGTACAGATACGCATTACTTTCATAGTGGTTCACGC 1423 

| | | | llllllllllllll Dili llllllll III Mil MM 
1420 tgaatggttttgatggtactgatacacattacttccatggtggaccacgg 1469 

1424 GGCCATCATTGGATGTGGGATTCTCGCCTTTTCAACTATGGGAATTGGGA 1473 

II Mill llllllllllllllllllll lllllllllllll llllli _ o 

1470 ggtcatcactggatgtgggattctcgcctgttcaactatgggagttggga 1519 

1474 AGTTCTAAGATTTCTACTATCCAATGCAAGATGGTGGCTCGAGGAGTATA 1523 

| I I I llllli I Mil II llllllll llllllll M IIIII ^ 
1520 agttttaagatatttactgtcgaatgcaaggtggtggcttgaagaataca 1569 

1524 AGTTTGATGGTTTCAGATTTGACGGTGTAACCTCAATGATGTACACTCAT 1573 

II I M I I II I II MMIM II II Mill IIIMIII MUM 
1570 agtttgatgggtttcgatttgatggggtgacctccatgatgtatactcat 1619 

1574 CATGGATTACAAGTAGCATTTACGGGGAACTACAGTGAATACTTTGGATT 1623 

l I 1 I l I I I II II llllllll II Mill I IIIII llllllll 
I I I M I I I I I M i I i i i i i i ii iiiii i i i i i iii 

1620 catggtttacaggtggcatttactggcaactatggcgaatattttggatt 1669 

• • • 

1624 TGCCACTGATGCTGATGCAGTAGTTTACTTGATGCTGGTAAATGATTTAA 1673 

Ml lllllll MMMMIMMMMMMMMII II III III 

1670 tgctactgatgttgatgcagtagtttacttgatgctggtgaacgatctaa 1719 
1674 TTCATGGACTTTATCCTGAGGCCATAACCATCGGTGAAGATGTCAGTGGA 1723 

lllllll llllllllllllll II MM llllllllllllll II 

1720 ttcatgggctttatcctgaggctgtagccattggtgaagatgtcagcggg 1769 

• • • • 

1724 ATGCCTACATTTGCCCTTCCTGTTCAAGATGGTGGGGTTGGTTTTGATTA 1773 

Mill MUM IIIMIIIIIIMIIIIII MIMIMIM II 

1770 atgcccacattttgtattcctgttcaagatggtggtgttggttttgacta 1819 
1774 TCGCCTTCATATGGCTGTTCCTGACAAATGGATTGAACTCCTCAAGCAAA 1823 

III I MIMIMIM II MIMIMIM 1 1 1 M I ! 1 1 II 1 1 1 1 1 

1820 tcgtttgcatatggctgtaccggacaaatggatcgaactcctcaagcaaa 1869 

• • ■ • • 
1824 GTGATGAATCTTGGAAGATGGGTGATATTGTGCACACACTGACTAACAGA 1873 

MM I II I llllli MIMIMIM llllllll II II II III 

1870 gtgacgaatattggaaaatgggtgatatcgtgcacaccctaacgaataga 1919 

. . • • • 

1874 AGGTGGTCAGAGAAGTGTGTTACTTATGCTGAAAGTCATGATCAAGCACT 1923 

lllllllllllllllllllllllllllll MMIMIMI llllllll 

1920 aggtggtcagagaagtgtgttacttatgcagaaagtcatgaccaagcact 1969 

. • • • • 

1924 AGTTGGTGACAAAACTATTGCATTCTGGTTGATGGACAAGGATATGTATG 1973 

Illlllllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 lllllllllllll 

1970 agttggtgacaagactattgcattctggttgatggataaggatatgtatg 2019 

« • • • • 

1974 ATTTTATGGCTCTGGACAGACCGGCAACACCTAGCATTGATCGTGGAATA 2023 

lllllllllllll llllllll llllllll lllllllllllll III 

2020 attttatggctctagacagaccttcaacacctcgcattgatcgtgggata 2069 

• • • • * 
2024 GCATTGCATAAAATGATTAGACTTATCACAATGGGGTTAGGAGGAGAAGG 2073 

IIIII llllllllllllll III MM IIIII llllllll Mill 

2070 gcattacataaaatgattaggcttgtcaccatgggcttaggaggcgaagg 2119 
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2074 CTATOTTAACTTTATGGGAAATGAGTTCGGACATCCTGAATGGATTGATT 2123 

IIIIMIII II llllllllllllll II llllllllllllll mi 
2120 ctatcttaatttcatgggaaatgagtttgggcatcctgaatggatagatt 2169 



2124 TTCCAAGAGCTCCACAAGTACTTCCAAATGGTAAATTCATCCCAGGGAAT 2173 

I 1 1 1 I I I I II III Mlllllllll II lllllll II 

2170 tcccaagaggcccgcaaagtcttccaaatggctcggtcctcccaggaaac 2219 



2174 AACAACAGTTATGATAAATGCCGTCGAAGATTTGACCTGGGTGATGCGGA 2223 

Ml 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 i 1 1 1 lllllllllll II Mill II 

2220 aactacagttttgataaatgccgtcgtagatttgaccttggagatgcaga 2269 

a a a a 

2224 CTATCTTAGGTATCGTGGCATGCTAGAGTTTGACCGCGCGATGCAGTCTC 2273 

IMIIIII 1 1 1 1 III Mil lllllllll I II MUM II 

2270 ttatcttagatatcatggtatgcaagagtttgatcaggccatgcagcatc 2319 

• • • • • 
2274 TCGAGGAAAAATATGGGTTCATGACATCAGACCACCAGTACATATCTCGA 2323 

I MM!!!!!!!!!! !!!!!!!!!!! !! 1 1 1 i 1 1 1 1 !!!!! !! 

2320 ttgaggaaaaatatggattcatgacatctgagcaccagtatatatcgcgc 2369 

• • • » ■ 
2324 AAGCATGAAGAGGATAAGATGATTATATTTGAGAAGGGAGATCTGGTATT 2373 

II II II lllllllll MM II II MM llllll lllllll 

2370 aaacacgaggaggataaggtgatcatcttcgagagaggagatttggtatt 2419 

• • • a • 

2374 TGTGTTCAACTTCCATTGGAGTAACAGCTATTTTGACTACCGTGTTGGTT 2423 

1 1 1 1 1 1 1 1 1 1 1 1 M IMIIIII f 1 1 ! 1 1 1 1 1 1 1 ! 1 1 II II MM 

2420 cgtgttcaacttccactggagtaatagctattttgactatcgcgtcggtt 2469 



2424 GTTTAAAGCCAGGGAAATATAAGGTGGTCTTGGACTCAGATGCTGGACTC 2473 

llllllllll II ll ll ill l ll lllllllllll l III III 

2470 gtttaaagcctggaaagtacaagattgtgttggactcagacgatggcctc 2519 

• a a a a 

2474 TTTGGTGGATTTGGCAGGATCCATCACACTGCAGAGCACTTCACTGCCGA 2523 

lllllllllll I II I I I I I III III llllllllll II 
2520 tttggtggattcagtcggcttgatcatgatgctgagtacttcactgctga 2569 

a a a a * 

2524 TTGTTCACATGACAACAGGCCCTACTCGTTCTCAGTTTATTCTCCTAGCA 2573 

II I lllllllllll II I II Mill II II I II MM 

2570 ctggccgcatgacaacagaccatgttcattctcggtgtacaccccaagca 2619 

a • 

2574 GAACCTGCGTTGTCTATGCTC 2594 

Mill III II Mill I 

2620 gaaccgccgtcgtgtatgcac 2640 
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riceSBEIIaIR- seq 

1 CTCGAGTCTA GATCGCGTC G GTTGTTTA AA GCCTGGA AAG TACAAG ATTGT 

56 GTTGGACTC AGACGATGGC CTCTTTGGT G GATTCAGT CG GCTTGAT CATGA 

111 TGCTGAGT A CTTCACTGC TGACTGGCCG CATGACAAC A GACCATGT TCATT 

166 CTCGGTG TA CACCCCAA G CAGAACCGC CGTCGTGTAT GCACTTACA GAGGA 

221 CTAATG ATC AGCTCTG AT CATTGGGG G AACAACTCA AGGGAGTTGG TGGTA 

276 ATGAC GCCG GAATAC AAC TCAAGTG AA AGGTGAAA A GAAAGGCTGC CCTGA 

331 CGAT GTGAT TTGAG GGGC TTGTGT TTC ATCGCCA AT GCCAGGAAGA TGAGG 

386 TAG AAAAGC CTAC TGATG AGCTC CTGT TTTCGA GTG ACTCGTGAAG GAAAT 

441 AG ACCAGGG TGA ACGGCT TTTT TCAGA GCTAT ACCA AACCCATCCT ATGTT 

496 G CGCATTCG CT GTAGTTT TGT ACATAA CGAT ATCGG TTGGCATTTG TATGT 

551 TTATGAATA A TCTGTTCG AC AGAAATG TTT TTCTCC TTGTAACTAG TGAA 

606 TTC 



riceSBEIIblR. seq 

1 CTCGAGTCTA GNNNNNNNN N NNNNNNNN NN NNNNNNN NNN NNNNNN NNNNN 

56 NNNNNNNNN NNNNNNNNNN NNNNNNN NN N NNNNNNNN NN NNNNNNN NNNNN 

111 NNNNNNNN N NNNNNNNNN NNNNNNNNNN NNNNNNNNN N NNNNNNNN NNNNN 

166 NNNNNNN NN NNNNNNNN N NNNNNNNNN NNNNNNNNN G CTCCAGCGG AATGA 

221 GAACAC CAA GAGGCAG CA TGCAAGTG T GTGCGGCTG CTAGTGCGAA GGAGC 

27 6 AAGAA AAAC TAGTTG CCA GCAATCT GT GAACGGCT T TCCTAGGTTC TGCTT 

331 CGAT GAATG CCGGA TAGA CTAGAC ANN NNNNNNN NN NNNNNNNNNN NNNNN 

386 NNT TGTAGT TTT A GTTTG TGAGG GAAA GAAACG TTT ATTTGTAATT ATCTG 

441 TG GCTGTCG AAC GGCGAC GAAA CCATG AACCC CGTA TATTTGTTGG TACCG 

496 T TCGAACTG CC AGTTATA CAT AGTTCT GCAC TTCTG TACATCTTGT GATGC 

551 TACTAGTGA A TTC 



riceSBEIIR. seq 

1 CTCGAGTCTN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNN 

56 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNN 

111 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNN 

166 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNN 

221 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNN 

276 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNN 

331 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNN 

386 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNN 

441 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNN 

496 NNNNNNNNNN NNNNNNNAGC ATCAGATTTC TTGATCAGGA GCAACTGTTG GTGCC 

551 CTTGTAAACT AGTGAATTC 



Figure 8 



